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ABSTRACT 

All major on-line social networks, such as MySpace, Face- 
book, Live Journal, and Orkut, are built around the concept 
of friendship. It is not uncommon for a social network par- 
ticipant to have over 100 friends. A natural question arises: 
are they all real friends of hers, or does she mean something 
different when she calls them "friends?" Speaking in other 
words, what is the relationship between off-line (real, tradi- 
tional) friendship and its on-line (virtual) namesake? In this 
paper, we use sociological data to suggest that there is a sig- 
nificant difference between the concepts of virtual and real 
friendships. We further investigate the structure of on-line 
friendship and observe that it follows the Pareto (or dou- 
ble Pareto) distribution and is subject to age stratification 
but not to gender segregation. We introduce the concept of 
digital personality that quantifies the willingness of a social 
network participant to engage in virtual friendships. 

Keywords: Online Social Network, Friend, Overlay, Demo- 
graphics. 

1. INTRODUCTION 

Massive online social networks (MOSN) have become in- 
creasingly popular in the last several years. The largest on- 
line social networks, such as Facebook, Linkedln, and MyS- 
pace, have tens and hundreds of millions of member accounts 
(some of which, however, can be dormant). Direct descen- 
dents of the Internet forums and chatrooms of the 1980s and 
1990s, they serve as venues for communication, socialization, 
cooperation, learning, electronic commerce, and even cyber- 
crime. 

All modern MOSNs have similar internal structure. Each 
MOSN can be represented as a mesh of interconnected nodes, 
where every node represents a member account, and the con- 
nections between the nodes represent relationships between 
the members. A member account typically represents one 
human user, but it is not uncommon for a single user to 
create multiple online personalities (e.g., for the purpose of 
separating business and leasure) or for a group of users (say, 
working together on a project or representing a closely con- 
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nected social group) to create a single member account. For 
the rest of the paper we assume that each member account 
corresponds to one human user. 

A connection between two MOSN members is typically es- 
tablished if and when the two believe to share something in 
common, such as a common background, common friends, 
common interests and activities — or are interested in ex- 
ploring friends, interests, and activities of others. A connec- 
tion can be implicit — through online communities (member 
groups by interests and causes [121 1 13 j ^ — or explicit. Ex- 
plicitly, a MOSN member can have one or more so-called 
"friends' lists," to which the connections are added accord- 
ing to certain rules. It is mainly through the online ("vir- 
tual") friendship that the communications between the net- 
work members occur (which does not exclude the possibility 
of peer-to-peer communications between the members who 
are not formally virtual "friends"). 

In this paper, we will show that there is a significant dif- 
ference between the concepts of virtual and real friendships. 
We will present quantitative data from a real MOSN that 
demonstrate that not all "friends" on the Internet are real 
friends and that the overlap between the circles of offline 
friends and online friends exists but is not overwhelming. 
We will analyse the demographics of the online friendship. 

In addition, we introduce the concept of quantitative per- 
sonality that measures an MOSN user's willingness to en- 
gage in virtual friendships, and discuss its relevance to the 
traditional offline friendship. 

2. RELATED WORK 

The significant difference between virtual and real friend- 
ship as such has been explored in [T], however, no quantita- 
tive evidence was presented. 

The mechanisms of "friendship" allow the users to estab- 
lish new connections and enable social searching (locating 
and maintaining offline connections online, in a virtual set- 
ting, to learn more about them, date with them, or even 
engage in casual sex) [7;. It has been shown that through 
"friendships" users impact their friends' decisions [2], af- 
fect the predictability of the friends' actions and adopt be- 
haviours exhibited by their friends [3], and influence the be- 
havior patterns of their friends [U [11] . 

Some numerical data that describe the composition of the 
online friends' body and the relationship between the on- 
line and offline friendship have been presented in [4| [5] [9]. 
The insignificance of the online/offline overlap is mentioned 
in [9]. Colder et al. [5] study the distribution of number of 
friends per user in Facebook and explain that "people add 
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friend links for a variety of reasons, not always for reasons 
that imply the pair are friends in the conventional sense." 
Finally, Gilbert et al. [3] explore the gender aspect of online 
friendship and separate online friendships into "strong ties" 
and "weak ties." However, the separation is based on the 
friend-to-friend message statistics rather than on the social 
and historical origins of the friendship links. 

3. FRIENDSHIP IN ONLINE SOCIAL NET- 
WORKS 

While initially the meaning of a "friendship" relationship 
in MOSNs was to represent a personal friendship ("affection 
arising from mutual esteem and good will" |14]). the simplic- 
ity of befriending people online soon distorted the original 
intention. Being a friend in a MOSN can be considered a 
necessary but not a sufficient condition for being a true on- 
line friend [5]. To distinguish between true friends (online 
of offline) and registered online "friends," we will refer to the 
latter as contacts. 

The reasons that stimulate members to engage in online 
"friendships" range from real sincere friendship to the inabil- 
ity to say "no" and the desire to explode popularity [I]. In 
many online social networks (e.g., Live Journal) "friendship" 
is not even reciprocal, that is, there is no need to obtain 
someone's permission to declare her as a "friend." As a re- 
sult, it is not uncommon for up to 15% of MOSN members 
to have more than 100 contacts (to have the degree of over 
100, Figure 1). We claim that most of these contacts are not 
friends in the conventional sense. In fact, all relationships 
can be roughly separated into three groups: true friends 
(strong tie, according to [1]), good acquaintances (medium 
strength ties), and random acquaintances (weak ties). 

To validate our hypothesis, we used two subsets (groups) 
of members of Odnoklassniki.Ru (Classmates) — the leading 
Russian-language MOSN. The group A contains ca. 500 
thousand members, or 5% of the total MOSN population 
(at the time of writing), together with all the "friendship" 
connections among the members and the basic demographics 
(age and gender). 

Note that group A has an unusual — geometric — node de- 
gree distribution (Figure [TJ. More often, major social net- 



works follow the Pareto or double Pareto power law [51 110]. 

The other group B (the subset of A) contains 150 mem- 
bers with additional information about the nature of their 
"friendships." This additional information was obtained by 
interviewing ca. 1000 network members, of which only a 
small fraction (15%) responded. Each member in the group 
B was asked the following four questions (in Russian): 

1. How many real friends or relatives do you have on your 
contact list? 

2. How many of the remaining members on the list are 
good acquaintances of yours? 

3. How many of the remaining members on the list are 
random acquaintances of yours? 

4. How many of your real friends do not have an account 
at Odnoklassniki.Ru or are not on your contact list? 

The purpose of the fourth question was to estimate the 
fraction of the overall true friends' body that is also on the 
member's MOSN contact list. It was left up to the respon- 
dents to decide which contacts belonged to what group. 

4. ONLINE FRIENDSHIP AS EXTENDED 
FRIENDSHIP 

The results of the poll are shown in Figure [2] The relative 
sizes of the blocks are proportional to the average numbers 
of MOSN members in the corresponding classes. 

An average group B member has 48 contacts (vs. 60 con- 
tacts on the list of a group A member). Of them, only 
F on — 12 (or 25%) are recognized as real friends. More than 
50% of contacts are good acquaintances (F ga = 25). The re- 
maining F ra = 11 contacts are considered random acquain- 
tances. There is an exceptionally low correlation between 
the total number of contacts F and Fga and low negative 
and positive correlations between F and Fon and F and Fra, 
respectfully. This means that the fraction of good acquain- 
tances does not depend on the size of the "first circle" of 
contacts, while the fraction of good friends slowly diminishes 
at the expense of the gradually growing fraction of random 
acquaintances. The observed behavior is a variation of the 
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Figure 3: Experimental and theoretical age differ- 
ence distribution in all friendships among the group 
A members (fraction of the number of pairs with 
age difference X against the age difference) 

"rich get richer" scheme, where random acquaintances — the 
"true friendship' ballast" — are the equivalence of wealth. 

Another interesting observation is that an average group 
B member has a total of 20 real friends. Of them, 12 friends 
(60%) are on the MOSN contact list and eight are not, i.e., 
the online "friendship" somewhat absorbs traditional friend- 
ship (and also substantially extends it). The fraction of 
"offline" friends has a substantial negative logarithmic cor- 
relation r — —0.3 with the number of contacts suggesting 
that the MOSN members who are socially active online ei- 
ther pull their offline friends into the MOSN or have fewer 
offline friends in the first place. 

We do not have data to compare the online and offline 
populations of good and random acquaintances. 

5. AGE STRATIFICATION 

The demographics of the online friendship in the group 
A is subject to strong age stratification: the members of 
a certain age prefer to contact members of approximately 
the same age. If there were no age preferences among the 
members, then we would expect about 10 times fewer same- 
age relationships and 10 times more relationships with larger 
age difference than we observe in the group A (Figure [3]). 

Figure [4] shows the distribution of group A contacts' age 
(the maxima, or "ridges," are at the thick black lines). Ac- 
cording to the chart, all group A members can be roughly 
separated into four age categories: 

1. Ages 11-14 mainly befriend peers ("Ridge 1") and mem- 
bers in the range 19-30 year (teenage drive, "Ridge 3"). 

2. Ages 15-41 mainly befriend peers (classmates, school- 
mates, comrades-in-arms, "Ridge 1"), as well as mem- 
bers in the lower teens ("Ridge 2") and in the lower 60s 
("Ridge 5"). 

3. Ages 42-52 mainly befriend peers ("Ridge 1"). 

4. Ages 53 and older mainly befriend members in the 20s 
(children, grandchildren, "Ridge 4"). 




Figure 4: Age stratification in group A: ages of a 
X's contacts against X's own age; local maxima are 
at thick black lines 

The "Plateaus" surrounding the main diagonal in the range 
from 25 to 55 years probably represent co-workers in their 
productive ages. 

We were not able to detect gender-based segregation within 
group A: 48% of the relationships are intergenderal, 29% are 
between females, and 23% are between males. 

6. QUANTITATIVE PERSONALITY 

To quantify the relative importance of a network member 
(say, "Alice") among her immediate contacts, we introduced 
a quantitative personality If — a measure of social popular- 
ity /marginality [T5]. Let e be all of Alice's immediate con- 
tacts (her "first circle"). Then her quantitative personality 
II is the logarithm of the ratio of the average number of her 
contacts' contacts p(e) to the number of her own contacts 
p(A). If Mi is Alice's contact and p(Mi) is the number of 
Mi's contacts, then: 

II = \og\^p{M i )/p{A)]/p{A) = logEpfMi) - 2Iogp(A). 

The logarithmic function is used to restrict the range of 
the ratio that otherwise could be very broad. 

The positive values of II mean that p(e) > p(A), i.e., 
Alice's contacts have on average more contacts than Alice 
herself, and Alice is socially marginal. Conversely, if II < 
then Alice has more contacts than her neighbors and she is 
socially popular. II = is the case of equilibrium when Al- 
ice is an average MOSN member. It has been shown in |15| 
that the marginals outnumber the popular members approx- 
imately by the factor of 10. 

We compared the values of II calculated through the anal- 
ysis of the MOSN, with our experimental data to determine 
if this online parameter is relevant to the nature of the on- 
line friendship relation and to the members' offline behavior. 
What we discovered was a reasonably strong negative linear 
correlation between II and the number of online friends F on 
(r = —0.42), good acquaintances F ga (r = —0.5), and ran- 
dom acquaintances F ra (r = —0.42). This is consistent with 
the definition of II: smaller II means more contacts. 

However, the correlation between II and the number of of- 
fline friends Foff is close to zero: the quantitative personality 



is probably not related to offline friendship. 

7. CONCLUSION 

The concept of friendship is fundamental to all major on- 
line social networks. However, ridiculously large numbers 
of online friends that we observe in a typical massive on- 
line social network (MOSN) make us question the validity 
of the term "friendship." By interviewing a randomly cho- 
sen group of members of Odnoklassniki.Ru (a major Rus- 
sian social network), we demonstrate that online friendship 
is an umbrella name for real friendship and various degrees 
of acquaintanceship and that on average only 25% of con- 
tacts are recognized as real friends. More popular MOSN 
members tend to have fewer real offline-only friends (either 
because they drag their conventional friends into the MOSN 
or because they had inherently fewer offline friends). 

We observe strong age stratification in online "friendship": 
all MOSN members roughly fall into four age ranges with 
range-specific age preferences; the members of the two largest 
of the ranges, 15-52 years, prefer to befriend people of about 
the same age. 

We apply the concept of quantitative personality proposed 
in an earlier paper to the members of Odnoklassniki.Ru, and 
discover that it is not correlated with the number of real 
offline friends, which probably means that it is an online- 
only attribute of social behavior. 
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